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GENERATION OF RECOMBINENT GENES IN 
SACCHAROMYCES CEREVISIAE 

Description 

The present Invention relates in general to methods for generating 
and detecting recombinant DNA sequences in Saccharomyces cere- 
visiae and plasmids and S. cerevisiae cells used for conducting the 
5 inventive methods. 

DNA sequences for which these methods are relevant include pro- 
tein-encoding and non-coding sequences; they may also consist of 
larger continuous stretches that contain more than a single coding 
sequence with intervening non-coding sequences, such as those 
1 0 that as may belong to a blosynthetic pathway. 

The miCTobial and enzymatic production of substances such as en- 
zymes and other proteins Is an Important economical topic. Enzymes 
are biocatalytically active proteins not only responsible for the me- 
tabolism of natural compounds and organisms, but also utilized for 

15 the industrial production of natural and non-natural compounds. En- 
zymes or those compounds produced by the help of enzymes can be 
used for the production of drugs, cosmetics, foodstuffs, etc. How- 
ever, the industrial use of enzymes has been greatly hindered by 
their target specifity and the specific conditions under which they can 

20 function- Other. proteins have therapeutic applications in the fields of 
human and animal health. Important classes of medically important 
proteins include cytokines and growth factors. 

Proteins, enzymes, and pathways with novel or improved functions 
and properties can be obtained either by searching among largely 
25 unknown natural species or by improving upon currently known natu- 
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ral proteins or enzymes. The latter approach may be more suitable 
for creating properties for which natural evolutionary processes are 
unlikely to have been selected. 

One promising strategy to create such novel desirable properties 
5 and to redesign enzymes, other proteins, non-coding sequences or 
pathways is by directed molecular evolution. Conventionally, as di- 
rect evolution of DNA sequences has been achieved with such tech- 
niques as site-directed mutagenesis, multi-site or cassette 
mutagenesis, random mutagenesis, and error prone PGR. Recently, 
10 gene shuffling approaches to optimize or fine-tune the properties of 
enzymes or proteins have attracted much attention. These directed 
evolutionary techniques can produce enzymes that can improve ex- 
isting technology, produce novel products and expand the capabili- 
ties of synthetic chemistry. 

15 A number of different mutagenesis methods exist, such as random 
mutagenesis, site-directed mutagenesis, oligonucleotide cassette 
mutagenesis, or point mutagenesis by error-prone PGR. Random 
mutagenesis, for example, entails the generation of a large number 
of randomly distributed, nucleotide substitution mutations in cloned 

20 DNA fragments by treatment with chemicals such as nitrous acid, 
hydrazine, etc. Enror-prone PGR has been developed to introduce 
random point mutations into cloned genes. Modifications that de- 
crease the fidelity of the PGR reaction include increasing the con- 
centration of MgGIa, adding MnGla., or altering the relative concentra- 

25 tions of the four dNTPs. 

These traditional mutagenesis methods focus on the optimization of 
individual genes having discrete and selectable phenotypes. The 
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general strategy is to clone a gene, identify a discrete function for 
the gene, establish an assay by which it can be monitored, mutate 
selected positions in the gene and select variants of the gene for 
improvement in the known function of the gene. A variant having im- 
5 proved function can then be expressed in a desired cell type. 
Repetitive cycles of mutagenesis methods can be carried out to ob- 
tain desirable enzyme properties. 

Each of these conventional approaches has an implicit sequence 
search strategy. The strategies employed in the above techniques of 

10 sequence searching are very different. Performing a saturating site- 
directed mutagenesis search involves a process of installing every 
possible permutation at a site of interest. For a protein, this proce- 
dure consists of repladng an amino acid at a site of interest with all 
1 9 other amino acids and searching the resultant library for improved 

15 mutants, in sequence space terms this means that a very small re- 
gion has been searched very thoroughly. In comparison, cassette- 
mutagenesis inserts a random peptide sequence in a specific region 
of a protein, giving a less thorough sampling of a larger, defined re- 
gion of sequence space. Enror-prone PGR involves repeated copying 

20 of a sequence, with the introduction of a low but significant number 
of errors. In this case, a sparse sampling of a less defined region of 
sequence space is achieved. In each of these strategies, the best 
mutant obtained in each round of selection is used to initiate the next 
round. 

26 However, traditional mutagenesis approaches for evolving new prop- 
erties in enzymes have a number of limitations. First, they are only 
applicable to genes or sequences that have been cloned and func- 
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functionally characterired Second, these approaches are usually 
applicable only to genes that have a discrete function. Therefore, 
multiple genes that cooperatively confer a single phenotype usually 
cannot be optimized in this manner. Finally, these approaches can 
5 only explore a very limited number of the total number of permuta- 
tions, even for a single gene. In view of these limitations, conven- 
tional mutagenesis approaches are inadequate for improving cellular 
genomes with respect to many useful properties. For example, 
improvements in the capacity of a cell to express a protein might re- 

10 quire alterations in transcriptional efficiency, translation and post- 
translatlonal modifications, secretion or proteolytic degradation of a 
gene product It therefore might be necessary to modify additional 
genes having a role in one or more of these cellular mechanisms in 
order to express a protein with new properties. Attempting to Indi- 

15 vidually optimize all of the genes having such function would be a 
virtually impossible task. 

Most of the problems associated with conventional mutagenesis ap- 
proaches can be overcome by gene shuffling approaches. Gene 
shuffling entails randomly recombining different sequences of func- 

20 tionai genes, enabling the molecular mixing of naturally similar or 
randomly mutated genes. DNA or gene shuffling, or variations of 
these techniques, have been used to improve the activity, stability, 
folding, and substrate recognition properties of enzymes. In com- 
parison to conventional mutagenesis approaches with gene shuf- 

25 fling, the probability of obtaining mutants with improved phenotype is 
significantly higher. Gene shuffling is fundamentally different from 
conventional strategies in that it recombines favorable mutations in a 
combinatorial fashion. It therefore will search much larger regions of 
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sequence space much more sparsely and with a bias towards pro- 
ducing functional sequences. It also allows more beneficial muta- 
tions from each round of selection to be retained in the next round 
because it allows sequence information to be contributed from more 
5 than one source. Whereas conventional strategies also allow for the 
fixation of negative mutations, this is not the case for gene shuffling 
approaches. Therefore, it is not surprising that gene shuffling strate- 
gies have yielded much more dramatic results. 

DNA or gene shuffling approaches are based on recombination 

10 events between regions with a certain homology or between 
stretches of identity. A key organism used in experiments to examine 
genetic recombination in eukaryotes has been the budding yeast 
Saccharomyces cerevisiae. The study of these processes in a sim- 
ple, unicellular organism has the obvious advantage of the ease of 

15 manipulation of DNA sequences and the possibility of studying spe- 
cific recombination events induced synchronously in a large propor- 
tion of cells. Furthermore, over the last few decades a wealth of ex- 
pertise has been accumulated both in the fermentation technology 
and the basic genetics of this organism, which is at present the best 

20 studied eukaryote at the molecular level. Due to its non-pathogenic 
character, its secretion proficiency and its glycosylation potential, S. 
cerevisiae is a preferred host organism for gene cloning and gene 
expression. Therefore, the technical problem underlying the present 
invention is to provide methods and means for the generation of re- 

25 combinant mosaic genes in Saccharomyces cerevisiae. 
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The present invention solves this underlying technical problem by 
providing a process for generating and detecting recombinant DMA 
sequences in Saccharomyces cerevisiae comprising the steps of: 

a) generating first diploid S. cerevisiae cells bearing in a defined 
5 locus of their genome a first recombination cassette compris- 
ing a first DNA sequence to be recombined, whidi is flanked 
by at least a first and a second marker sequence, in an allelic 
position a second recombination cassette comprising a sec- 
ond DNA sequence to be recombined, which is flanked by at 

1 0 least a third and a fourth marker sequence, 

b) inducing the sporulation of the first diploid cells obtained in a) 
and 

c) isolating haploid ceils containing recombination cassettes in 
which first recombined DNA sequences are flanked by the 

15 first and fourth marker sequences, and haploid cells contain- 

ing recombination cassettes in which second recombined 
DNA sequences are flanked by the second and the third 
marker sequences. 

The present invention provides a yeast-based system to screen for 
20 recombination events between at least two diverging DNA se- 
quences. The system is based on the sexual reproductive cycle of S. 
cerevisiae, which alternates between a haploid phase and a diploid 
phase. In the first step of the inventive process, diploid S. cereivisiae 
cells are generated, which are heterozygous for these recombination 
25 substrates. The DNA sequences to be recombined are integrated in 
the genome of the diploid S. cerevisiae cells at allelic positions. 
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Each DNA sequence to be recomblned is integrated in the fonm of a 
recombination cassette, which comprises besides this DNA se- 
quence at least two marker sequences that flank the DNA sequence, 
whereby the two recombination cassettes comprise at least four dif- 
5 ferent marker sequences. 

The heterozygous diploid cells thus obtained are then grown under 
conditions which induce the processes of meiosis and spore fonma- 
tion. Meiosis is generally characterized by elevated frequencies of 
genetic recombination, which is initiated via the formation and sub- 

10 sequent repair of double-strand breaks (DSBs) induced early in 
meiosis I prophase. Yeast meiotic cells are therefore of particular 
interest because they experience high levels of recombination as a 
result of the genome-wide induction of DSBs. Thus the products of a 
first round of meiosis, which are haploid cells or spores for each 

15 meiosis event four produced by a parental diploid cell, can contain 
recombined DNA sequences due to recombination between the .two 
diverged DNA sequences. 

Recombination between the two diverging DNA sequences during 
meiosis can also lead to an exchange of the flanking marker se- 

20 quences. Therefore, the present process allows a rapid and simple 
identification of recombined DNA sequences by the selection of indi- 
vidual cells or molecules in which an exchange of marker sequences 
flanking a recombination substrate has taken place. The recombi- 
nants obtained after the first round of meiosis are therefore charac- 

25 terized in that they contain and/or express at least one marker se- 
quence of the first recombination cassette and at least one marker 
sequence of the second recombination cassette. In particular, re- 
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combinant spores can contain the first marker sequence of the first 
recombination cassette and the fourth marker sequence of the sec- 
ond recombination cassette or the second marker sequence of the 
first recombination cassette and the third marker sequence of the 
6 second recombination cassette, whereby both types of recombinant 
spores contain besides this different marker combination also differ- 
ent recombinant DNA sequences. Both types of spores containing 
recombinant sequences can easily be selected and distinguished 
under conditions that permit selection for the new recombinant 
10 marker configurations produced by recombination during meiosis. 

The inventive process can be conducted either in wild-type or mis- 
match repair-defective S. cerevisiae cells. The processes by which 
damaged DNA is repaired and the mechanisms of genetic recombi- 
nation are intimately related, and it is known that the mismatch repair 

15 machinery has inhibitory effects on the recombination frequency be- 
tween divergent sequences, i.e. homeologous recombination. Muta- 
tions of the mismatch repair system therefore greatly enhance the 
overall frequency of recombination events in yeast On the other 
hand, it is known that wild-type S. cere\^siae cells have a mismatch 

20 repair-dependent recombination mechanism, which is based on dis- 
tantly spaced mismatches in two recombination substrates. Depend- 
ing on the DNA sequences to be recombined, either wild-type or 
mismatch repair-defective S. cerevisiae cells can be used to obtain 
recombined sequences. 

25 The Inventive process has the advantage that it is iterative, i.e. it 
allows further rounds of recombination. The products of the first 
round of meiosis, i.e. haploid cells of opposite mating types which 
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comprise different recombined DMA sequences, are mated again to 
obtain diploid cells which are heterozygous for recombined DNA se- 
quences. In the diploid cells thus obtained meibsis is again induced, 
whereby the recombined DNA sequences are once again recom- 
5 bined, leading again to an exchange of the two markers flanking 
each recombination substrate. The new haploid recombinants ob- 
tained after the second mefosis can now be easily identified by the 
joint expression of either \hose marker genes which flanked the first 
DNA sequence in the original first recombination cassette or those 
10 marker genes which flanked the second DNA sequence in the origi- 
nal second recombination cassette. 

In a preferred embodiment of the invention therefore haploid cells 
containing a recombination cassette with the first recombined DNA 
sequences obtained in the first round of the inventive process are 

15 mated with haploid cells containing recombination cassettes with the 
second recombined DNA sequences obtained in the first round of 
the inventive process in order to generate second diploid cells. In 
the thus obtained second diploid cell sporulation is induced, result- 
ing in the generation of haploid cells. In the next steps haploid cells 

20 containing recombination cassettes in which third recombined DNA 
sequences are flanked by at least the first and second marker se- 
quences and haploid cells containing recombination cassettes in 
which fourth recombined DNA sequences are flanked by at least the 
third and fourth marker sequences are isolated. 

25 Further recombined DNA sequences can be generated by subjecting 
the haploid cells containing third and fourth recombined DNA se- 
quences to one or more further cycles of mating and meio- 
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sis/sporulation. After each round of recombination, recombinants are 
either identified by the joint presence of at least one mariner se- 
quence that flanked the first recombination substrate and at least 
one marker sequence that flanked the second recombination sub- 
5 strate or by the joint presence of the two markers that flanked the 
first or the second DNA sequence in the starting recombination sub- 
strates. 

Therefore, an advantageous feature of the present process is that it 
is iterative: recombinant haploid progeny is selected individually or 

10 en masse and mated to one another, the resulting diploids are 
sporulated anew, and their progeny spores are subjected to appro- 
priate selection conditions to identify new recombination events. 
With the inventive process a large library of recombined, mutated 
sequences can be easily generated, and variants that have acquired 

15 a desired function can then be identified by using an appropriate 
selection or screening system. 

In a preferred embodiment of the invention the first diploid S. cere- 
visiae cell is generated by simultaneously or sequentially transform- 
ing a diploid S. cerevisiae cell with a DNA molecule comprising the 

20 first recombination cassette and a DNA molecule comprising the 
second recombination cassette and optionally allowing the Integra- 
tion of the two recombination cassettes into allelic positions on natu- 
ral chromosomes of the S. cerevisiae genome. The DNA molecules 
used can also be for example yeast artificial chromosomes (YAC). 

25 YACs are characterized in that they are linear DNA molecules that 
contain all the sequences necessary for stable maintenance in the 
yeast cell, such as a centromere, DNA replication origin and te- 
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lomeres as well as yeast selectable markers. Upon Introduction into 
a yeast cell YACs behave similar to natural chromosomes and there- 
fore can be considered as part of the yeast genome. In the context of 
the present invention the term "genome'' includes the whole of all 
5 hereditary components present within a cell, which are stably main- 
tained and Inherited. In case YACs are used as DNA molecules for 
introduction the first and second recombination cassettes into diploid 
S. cerevisiae cells it is not necessary to integrate the two recombina- 
tion cassettes Into allelic positions in natural chromosomes. In the 

10 case in which recombination cassettes are introduced into natural 
chromosomes, it Is possible to use a cloning vehicle, for example a 
plasmid, from which a fragment bearing the recombination cassettes 
can be liberated. Preferably the two respective marker sequences of 
the two recombination cassettes are flanked by targeting sequences 

15 which are homologous to a defined locus of the S. cerevisiae ge- 
nome. Alternatively, a DNA molecule can be used which does not 
contain a replication origin. In this case the DNA molecules must be 
able to integrate into a component of the genome and therefore con- 
tain targeting sequences which are homologous to a defined locus of 

20 the S. cerevisiae genome. 

In another preferred embodiment of the invention the first diploid S. 
cerevisiae cells are generated by fusing haploid cells bearing in a 
locus of their genome the first recombination cassette with S. cere- 
visiae haploid cells bearing in an allelic position of their genome the 
25 second recombination cassette. 

In still another preferred embodiment of the invention the first diploid 
S. cerevisiae cells are generated by mating haploid cells bearing in 
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a locus of their genome the first recombination cassette with S. cere- 
visiae haploid cells of opposite mating type bearing in an allelic posi- 
tion of their genome the second recombination cassette. 

In the context of the present invention the terms "mating" and "fus- 
5 ing" denote either the purposeful or the random combination of two 
haplo'rd cells containing different recombination cassettes. A pur- 
poseful mating or fusing of two haploid cells occurs, when two se- 
lected and//or isolated haploid cells of opposite mating type with de- 
sired properties are brought into contact under conditions stimulating 

10 mating and fusing, respectively. The two haploid cells can be de- 
rived from the same library of cells, which for example contain DMA 
sequences to be recombined or already recombined DMA se- 
quences, or from different libraries of cells, which for example con- 
tain DNA sequences to be recombined or already recombined DNA 

15 sequences. 

A random mating or fusing of two haploid cells can occur, when a 
plurality of different haploid cells are brought Into contact under con- 
ditions stimulating mating and fusing, respectively. The plurality of 
haploid cells can be derived from the same library of cells, which for 
20 example contain DNA sequences to be recombined or already re- 
combined DNA sequences, or from different libraries of cells, which 
for example contain DNA sequences to be recombined or already 
recombined DNA sequences. 

The inventive process for generating and detecting recombined DNA 
25 sequencies has the advantage that more than two diverging se- 
quences can be recombined. If, for example, four diverging DNA se- 
quences shall be recombined, then in the first step of the present 



wo 2005/075654 



13 



PCT/EP2005/000841 



process two different sets of diploid S. cerevisiae cells can be gen- 
erated. For example, a first set of diploid cells can be generated by 
mating or fusing haploid cells comprising a first and a second DMA 
sequence to be recombined and a second set of diploid cells can be 
5 generated by mating or fusing haploid cells comprising a third and a 
fourth DNA sequence to be recombined. After sporulation of the two 
sets of diploid cells haploid cells obtained from the first diploid ceil 
set that contain recombined DNA sequences due to recombination 
between the first and the second DNA sequence, are mated with 

10 appropriate haploid cells obtained from the second diploid cell set 
that contain recombined DNA sequences due to recombination be- 
tween the third and the fourth DNA sequences. The products of this 
mating are diploid cells which after sporulation give rise to haploid 
cells bearing recombined DNA sequences which comprise regions of 

15 the first DNA sequence, the second DNA sequence, the third DNA 
sequence and the fourth DNA sequence. If, for example, three di- 
verging DNA sequences shall be recombined, In the first step of the 
present process diploid S. cerevisiae cells are generated by, for ex- 
ample, mating or fusing haploid cells comprising a first and a second 

20 DNA sequence to be recombined. After sporulation of these diploid 
cells, the haploid cells thus obtained, which contain recombined 
DNA sequences due to recombination between the first and the sec- 
ond DNA sequences, can be fused or mated with haploid cells com- 
prising a third DNA sequence to be recombined. The products of this 

25 mating are diploid cells which after sporulation give rise to haploid 
cells bearing recombined DNA sequences which comprise regions of 
the first DNA sequence, the second DNA sequence and the third 
DNA sequence. In this way, five, sbc or more diverging DNA se- 
quences can also be recombined. 
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In a preferred embodiment haploid S. cerevisiae cells bearing the 
first or second recombination cassette are generated by: 

a) inserting the first DNA sequence to be recombined between 
the first and the second mariner sequence located adjacently 

5 on a first cloning vehicle and inserting the second DNA se- 

quence to be recombined between the third and the fourth 
marker sequence located adjacently on a second cloning ve- 
hicle, whereby the respective t^o marker sequences are 
flanked by targeting sequences which are homologous to a 
1 0 defined locus of the S. cerevisiae genome, 

b) excising from the cloning vehicles obtained in a) the first re- 
combination cassette and the second recombination cassette 
with flanking targeting sequences, respectively, whereby each 
excised fragment comprises the DNA sequence to be recom- 

15 bined, which is flanked by the respective two marker 

sequences and by targeting sequences, 

c) transforming the excised fragments obtained in b) separately 
into S. cerevisiae diploid cells, whereby the targeting se- 
quences direct the integration of the cassettes into that locus 

20 to which they are homologous, in order to obtain diploid cells 

heterozygous for the first cassette, or the second cassette, 

d) inducing separately the sporulation of the heterozygous dip- 
loid cells obtained in c) and 

e) isolating haploid cells containing the first cassette flanked by 
25 the first and second marker sequences and separately hap- 
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loid cells containing the second cassette flanked by the third 
and the fourth marker sequences. 

In a preferred embodiment of the invention the respective two marker 
sequences in the first or second cloning vehicle are flanked by tar- 
6 gating sequences which are homologous to the BUDSI-HCMI locus 
on chromosome III of the S. cerevisiae genome and which direct the 
Integration of the excised cassettes into that locus. 

In a preferred embodiment of the invention the cloning vehicle used 
for cloning the recombination cassettes is a plasmid. "Plasmitf" 
10 means an extrachromosomal element which can autonomously repli- 
cate. The plasmid is physically unlinked to the genome of the cell 
wherein it is contained. Most plasmids are double-stranded circular 
DNA molecules. In another embodiment the cloning vehicle is an 
YAC. 

16 In particular it is preferred to use as the first cloning vehicle, in which 
the first recombination cassette is cloned, plasmid pMXY9. Plasmid 
pMXY9 comprises the URA3 marker gene and the CAN1 marker 
gene. In this plasmid the two marker genes are adjacently located. 
Between the two marker genes are arranged several restriction sites, 

20 in particular recognition sites for the restriction enzymes Smal, Xbal, 
8g/ll and Pad, for inserting a DNA sequence to be recombined. The 
two marker sequences are flanked by targeting sequences homolo- 
gous to the BUD31'HCM1 locus on chromosome III of the S. cere- 
visiae genome. 

25 Furthermore, it is preferred to use as the second cloning vehicle, m 
which the second recombination cassette is cloned, plasmid 



wo 2005/075654 



16 



PCT/EP2005/000841 



pMXY12. Plasmid pMXY12 comprises the TRP1 mari<er gene and 
the CYH2 marker gene. In this plasmid the two mariner genes are 
adjacently located. Between the two genes are anranged several re- 
striction sites, in particular recognition sites for the restriction en- 
5 zymes Spel, Smd\ and Pad, for inserting a DNA sequence to be re- 
combined. The two marker sequences are flanked by targeting se- 
quences homologous to the BUD31'HCM1 locus on chromosome III 
of the S. cerevisiae genome. 

In a preferred embodiment of the invention the diploid cells used for 
10 transformation of the excised recombination cassette are auxotro- 
phic for at least two nutritional factors and resistant to at least two 
antibiotics. Preferably, the diploid cells are homozygous for the ura3- 
1 allele and the trpl-l allele, which renders the cells auxotrophic for 
uracil and tryptophan, respectively. Furthennore it is preferred that 
15 the diploid cells used for transfomiation are homozygous for the 
canl'100 allele and the cyh2R allele, which renders them resistant 
to canavanine and cycloheximide, respectively. 

In particular it is preferred, that diploid cells of the S. cerevisiae 
strain MXY47 are used for transfonmation, which are homozygous for 

20 the alleles ura3-1, trpl-t canl-WO and cyh2R and heterozygous for 
the msh2::KanMX mutation. When diploid cells of the strain MXY47 
are used for the transformation with the excised first or second frag- 
ments bearing recombination cassettes and their flanking targeting 
sequences, then transformants obtained can be sporulated to yield 

25 haploid wild type or msh2 segregants that bear the respective re- 
combination cassette. 
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According to the invention it may be preferred to use S. cerevisiae 
cells which have a functional mismatch repair systenn for the inven- 
tive process. The mismatch repair system belongs to the largest con- 
tributors to avoidance of mutations due to DNA polymerise errors in 
5 replication. Mismatch repair also promotes genetic stability by edit- 
ing the fidelity of genetic recombination. It is known that, therefore, 
the mismatch repair machinery has a somewhat inhibitory effect on 
recombination between diverged sequence. However, in a normal S. 
cerevisiae diploid another aspect of mismatch repair, temned mis- 

10 match repair-dependent recombination, was detected (Borts and 
Haber, Science, 237 (1987), 1459-1465). It is thought that the mis- 
match repair of widely spaced mismatches such as in diverged se- 
quences leads to new double-strand breaks that can In turn stimu- 
late a second round of (mismatch repair-dependent) recombination. 

15 In certain circumstances. In particular, when H: is known that the two 
recombination substrates used have widely spaced base differences, 
it is therefore useful to employ S. cerevisiae cells with a functional 
mismatch repair system for conducting the inventive process. 

In another preferred embodiment of the invention, S. cerevisiae cells 
20 that are deficient in the mismatch repair system are used. In S. cere- 
visiae several genes have been identified whose products share 
homology with bacterial mismatch repair proteins, including six 
homologues of the MutS protein, i.e. Msh1, Msh2p, Msh3p, Msh4, 
MshS and Msh6p, and four homologues of the MutL protein, i.e. 
25 Mlhip, Mlh2p, Mlh3p, and Pms1. It is known that in particular the 
PMS1 and MSH2 genes set up a barrier to the recombination of di- 
verged sequences. Therefore, in msh2 and pms1 mutants, meiotic 
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recombination between diverged sequences is increased, relative to 
the frequency of recombination in wild type cells. 

In the context of the present invention the term "deficient in the mis- 
match repair system'' means that the mismatch repair system (MMR) 
5 of a cell Is transiently or pemianently impaired. MMR deficiency of a 
cell or an organism can be achieved by any strategy that transiently 
or permanently impairs the mismatch repair including but not limited 
to a mutation of one or more genes involved In mismatch repair, 
treatment with an agent like UV light, which results in a global im- 

10 pairment of MMR, treatment with an agent like 2-aminopurin or a 
heteroduplex containing an excessive amount of mismatches to 
transiently saturate and inactivate the MMR system and inducible 
expression or repression of one or more genes involved in the mis- 
match repair, for example via regulatable promoters, which would 

15 allow for transient inactivation, i.e. during meiosis, but not during 
vegetative growth. 

In a preferred embodiment of the invention the mismatch repair defi- 
ciency of the S. cerevisiae cell is due to a mutation of at least one 
gene involved in the MMR. In a preferred embodiment the S. cere- 
20 visiae cells are deficient in the MSH2 gene. Preferably, diploid cells 
are homozygous for the msh2 allele, in which the MSH2 coding se- 
quences are replaced by the KanMX construct. 

In the context of the present invention the tenm "recombination cas- 
sette" refers to a DNA sequence comprising at least one recombina- 
26 tion substrate or one DNA sequence to be recombined, which is 
flanked by at least two different marker sequences. The first and the 
second recombination cassette differ in the DNA sequences to be 
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recombined and in the flanking marker sequences, such that any 
pair of recombination cassettes comprises two different DNA se- 
quences to be recombined and at least four different flanking marker 
sequences. 

5 in a preferred embodiment of the invention both the first and the 
second recombination cassettes are generated by inserting the re- 
spective DNA sequences to be recombined between two marker se- 
quences that are closely located on a cloning vehicle and which in 
turn are surrounded by targeting sequences that are homologous to 

10 a defined locus of the S. cerevisiae genome. The targeting se- 
quences therefore can direct the integration of an excised fragment 
containing a recombination cassette into this defined locus. The in- 
sertion of the DNA to be recombined between the two marker se- 
quences is preferably effected by genetic engineering methods. In a 

15 preferred embodiment of the invention the two marker sequences in 
the cloning vehicle are flanked by targeting sequences which are 
homologous to the BUD31-HCM1 locus on chromosome III of the S. 
cerevisiae genome. Therefore, the targeting sequences direct the 
integration of the excised fragments containing a recombination cas- 

20 settes into that locus. 

In the context of the present invention the terms "DNA sequences to 
be recombined" and "recombination substrate" mean any two DNA 
sequences that can be recombined as a result of meiotic recombina- 
tion processes, whereby recombination between these sequences 
25 can be due to homologous or non-homologous recombination. 

Homologous recombination events of several types are character- 
ized by the base pairing of a damaged DNA strand with a homolo- 
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gous partner, where the extent of interaction can involve hundreds of 
nearly perfectly matched base pairs. The term "^homology" denotes 
the degree of identity existing between the sequence of two nucleic 
add molecules, in contrast, illegitimate or non-homologous recombi- 
5 nation is characterized by the joining of ends of DNA that share no 
or only a few complementary base pairs. In yeast, non-honnologous 
repair and recombination events occur at significantly lower frequen- 
cies than homologous recombination events. 

The first and second DNA sequences to t>e recombined are diverg- 
ing sequences, i.e. sequences, which are not identical but show a 
certain degree of homology. This means, that the DNA sequences to 
be recombined diverge by at least one nucleotide. Preferably the 
DNA sequences to be recombined are sequences that share at least 
one or more homologous regions, which can be very short The ho- 
mologous regions should comprise at least 5-10 nucleotides, pref- 
erably more than 20-30 nucleotides, more preferred more than 30-40 
nucleotides and most preferred more than 50 nucleotides. In a pre- 
ferred embodiment of the invention the first and the second DNA 
sequences to be recombined diverge by at least one nucleotide, in 
particular more than 0,1 %, preferably more than 5 % to more than 
50 %. This means, that the first and second DNA sequences to be 
recombined can also diverge by 55%, 60%, 65 % or even more. 

Recombination substrates or DNA sequences to be recombined can 
have a natural or synthetic origin. DNA sequences to be recombined 
25 therefore can be derived from any natural source including viruses, 
bacteria, fungi including S. cerevisiae, animals, plants and humans. 
In a preferred embodiment of the invention the first and the second 
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DNA sequences to be recombined are derived from organisms other 
than S. cerevisiae. 

In a preferred embodiment of the invention DNA sequences to b© 
recombined are protein-encoding sequences, for example se- 
5 quences encoding enzymes, which can be utilized for the industrial 
production of natural and non-natural compounds. Enzymes or those 
compounds produced by the help of enzymes can be used for the 
production of drugs, cosmetics, foodstuffs, etc. Protein-encoding se- 
quences can also be sequences, \Arhich encode proteins, that have 

10 therapeutic applications in the fields of human and animal health. 
Important classes of medically important proteins include cytoWnes 
and growth factors. The recombination of protein coding sequences 
allows for the generation of new mutated sequences which code for 
proteins with altered, preferably improved functions and/or newly 

15 acquired functions. In this way it Is possible, for example, to achieve 
improvements in the themaostability of a protein, to change the sub- 
strate specificity of a protein, to improve Its activity, to evolve new 
catalytic sites and/or to fuse domains from two different enzymes. 
Protein coding DNA sequences to be recombined can Include se- 

20 quences from different species which code for the same or similar 
proteins that have in their natural context similar or identical flino 
tions. Protein coding DNA sequences to be recombined can include 
sequences from the same protein or enzyme family. Protein coding 
sequences to be recombined can also be sequences which code for 

25 proteins with different functions - for example, sequences that code 
for enzymes which catalyse different steps of a given metabolic 
pathway. In a preferred embodiment of the invention the first and the 



wo 2005/075654 



22 



PCT/EP2005/000841 



second DNA sequences to be recombined are selected from the 
group of gene sequences of the Oxa superfamily of B-lactamases. 

In another preferred embodiment of the Invention DNA sequences to 
be recombined are non-coding sequences such as sequences, 
5 which, for example, are involved \Anthin their natural cellular context 
in the regulation of the expression of a protein-coding sequence. 
Examples for non-coding sequences include but are not limited to 
promoter sequences, sequences containing ribosome binding sites, 
intron sequences, polyadenylation sequences etc. By recomblning 
10 such non-coding sequences it is possible to evolve mutated se- 
quences, which in a cellular environment result in an altered regula- 
tion of a cellular process - for example, an altered expression of a 
gene. 

According to the invention a recombination substrate or DNA se- 
15 quence to be recombined can of course comprise more than one 
protein coding sequence and/or more than one non-coding se- 
quence. For example a recombination substrate can comprise one 
protein coding sequence plus one non-coding sequence or a combi- 
nation of different protein coding sequences and d'rfferent non- 
20 coding sequences. In another embodiment of the invention DNA se- 
quences to be recombined therefore can consist of one or more 
stretches of coding sequences with intervening and/or flanking non- 
coding sequences. That means, the DNA sequence to be recom- 
bined can be for example a gene sequence with regulatory se- 
26 quences at Its 5'-terminus and/or an untranslated 3'-region or an 
mammalian gene sequence with an exon/lntron structure. In still an- 
other embodiment of the invention DNA sequences to be recom- 
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bined can consist of larger continuous stretches that contain more 
than a single coding sequence with intervening non-coding se- 
quences, such as those that as may belong to a bios/nthetio path- 
way or an operon. DNA sequences to be recombined can be se- 
5 quences, which have already experienced one or more recombina- 
tion events, for example homologous and/or non-homologous re- 
combination events. 

The recombination substrates can comprise non-mutated wild-type 
DNA sequences and/or mutated DNA sequences. In a preferred em- 
10 bodiment therefore it is possible to recombine wild-type sequences 
with already existing mutated sequences in order to evolve new mu- 
tated sequences. 

In the context of the present invention the term "marker sequences" 
refers to unique DNA sequences that are positioned upstream or 

16 downstream of a recombination substrate or an already recombined 
DNA sequence in Saccharomyces cerevisiae cells. The prince of 
a marker sequence on the same molecule of DNA as the recombina- 
tion substrate or already recombined DNA sequence, preferably in 
combination with another marker sequence positioned on the other 

20 side of the recombination substrate, allows that recombination sub- 
strate or already recombined DNA sequence to be recognized and 
selected for, whether by molecular or genetic methods. Therefore, in 
one preferred embodiment of the invention there must be one or 
more marker sequences upstream of each recombination substrate 

25 and one or more marker sequences downstream of each recombina- 
tion substrate, such that in a cell heterozygous for two different re- 
combination substrates, there are at least four different marker se- 
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quences altogether. This arrangement allows for the selection of 
crossovers Involving recombination substrates. It also allows further 
rounds of recombination to be canied out in a Iterative fashion. In 
another preferred embodiment of the invention more than one 
5 marker can be situated on each side of the recombination substrate. 
For example, additional markers can be introduced to increase the 
stringency of selection. 

Marker sequences may comprise protein-encoding or non-coding 
DNA sequences. In a preferred embodiment of the invention the pro- 
tein-encoding marker sequences are selected from the group con- 
sisting of nutritional markers, pigment markers, antibiotic resistance 
markers, antibiotic sensitivity markers and sequences that encode 
different subunits of an enzyme, which functions only, if both or more 
subunits are expressed in the same cell. In a further preferred em- 
bodiment of the invention the molecular non-coding marker se- 
quences include but are not limited to primer recognition sites, i.e. 
sequences to vi^ich PGR primers anneal and which allow an amplifi- 
cation of recombinants, intron/exon boundaries, promoter se- 
quences, downstream regulated gene sequences or restriction en- 
zyme sites. 

A "nutritional marker" is a marker sequence that encodes a gene 
product that can compensate an auxotrophy of an organism or cell 
and thus can confer prototrophy on that auxotrophic organism or 
cell. In the context of the present invention the term "auxotrophy" 
means that an organism or cell must be grown in a medium contain- 
ing an essential nutrient which cannot be synthesized by the 
auxotrophic organism itself. The gene product of the nutritional 
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marker gene promotes the synthesis of this essential nutrient miss- 
ing in the auxotrophic cell. Therefore, upon expression of the nutri- 
tional marker gene it is not necessary to add this essential nutrient to 
the medium in which the organism or cell is grown, since the organ- 
5 ism or cell has acquired prototrophy. 

A "pigment marker" is a marker gene wherein the gene product is 
involved in the synthesis of a pigment which upon expression will 
stain that cell, in which the pigment marker is expressed. A cell with- 
out the pigment marker does not synthesize the pigment and is 
10 therefore not stained. The pigment marker therefore allows a rapid 
phenotypical detection of that cell containing the pigment marker. 

An "antibiotic resistance marker" is a marker gene wherein the gene 
product confers upon expression to a cell, in which the expression of 
the antibiotic marker gene takes place, the ability to grow in the 
15 presence of a given antibiotic at a given concentration, whereas a 
cell without the antibiotic resistance marker cannot. 

An "antibiotic sensitivity marker" is a marker gene wherein the gene 
product destroys upon expression the ability of a cell to grow in the 
presence of a given antibiotic at a given concentration. 

20 In a preferred embodiment of the invention each of the gene prod- 
ucts of the first and third marker sequences can compensate an 
auxotrophy of a S. cerevisiae cell. Preferably, the first marker se- 
quence is URA3, the gene product of which can confer uracil proto- 
trophy to a uracil auxotrophic S. cerevisiae cell. Preferably, the third 

25 marker sequence is TRP1, the gene product of which can confer 
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tryptophan prototrophy to an tryptophan auxotrophic S. cerevisiae 
cell. 

In another preferred embodiment of the invention the gene products 
of the second and fourth marl<er sequences confer sensitivity to an 
5 antibiotic to a S. cerevisiae cell which is resistant to that antibiotic. 
Preferably, the second marker sequence is CAN1, the gene product 
of which can confer to a canavanlne resistant S. cerevisiae cell sen- 
sitivity to canavanine. Preferably, the fourth marker sequence is 
CYH2, the gene product of which can confer to a qrcloheximide re- 
10 sistant S. cerevisiae cell sensitivity to cycloheximide. 

In another prefenred embodimant of the invention the marker se- 
quences comprise annealing sites for PGR primers. Preferably, the 
first, second, third and fourth marker sequences are recognized by 
the primers KNS11, KNS28, KNS16, and KNS29. 

15 In a preferred embodiment of the Inventive process haploid cells 
containing recombination cassettes with either first, second, third or 
fourth recombined DNA sequences can be identified by PGR proc- 
esses in order to detect the presence of the respective marker com- 
bination. 

20 In another preferred embodiment of the inventive process haploid 
cells containing recombination cassettes with either first, second, 
third or fourth recombined DNA sequences are identified by plating 
the haploid cells on media that select for the presence on the same 
DNA molecule of the respective marker combination. This means 

25 that haploid cells containing first recombined DNA sequences are 
plated on a medium that selects for the presence of the first and the 
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fourth marker sequences. Haploid cells containing second recom- 
bined DNA sequences are plated on a medium that selects for the 
presence of the second and the third marker sequences. Haploid 
cells containing third recomblned DNA sequences are plated on a 
5 * medium that selects for the presence of the first and the second 
marker sequences. Haploid cells containing fourth recombined DNA 
sequences are plated on a medium that selects for the presence of 
the third and the fourth marker sequences. 

Another aspect of the present Invention relates to a process of gen- 
10 erating novel proteins, enzymes, pathways and non-coding se- 
quences with novel or improved functions and properties, whereby 
known protein-coding sequences or known non-coding sequences 
are subjected one or more recombination rounds by using the inven- 
tive process for generating and detecting recombinant DNA se- 
15 quences in S. cerevislae. 

Another aspect of the present invention relates to plasmid pMXY9. 
Plasmid pMXY9 comprises the URA3 marker gene and the CAN1 
marker gene, which are located adjacently. Between the two marker 
gene a polylinker sequence, comprising several restriction sites for 

20 inserting a DNA sequence to be recombined, is arranged. The two 
markers are flanked by targeting sequences homologous to the 
BUD31-HCM1 locus on chromosome III of the S. cerevisiae genome. 
The polylinker sequence between the two marker genes cornprises 
restriction sites for the restriction enzymes Smal, Xbal Bglll and 

25 Pad. 



Another aspect of the present invention relates to plasmid pMXY12. 
Plasmid pMXY12 comprises the TRP1 marker gene and the CYH2 
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marker gene. Between the two nnarker genes a polylinker sequence 
comprising several restriction sites for inserting a DNA sequence to 
be recombined is arranged. The two markers are flanked by target- 
ing sequences homologous to the BUD31-HCM1 locus on chromo- 
5 some III of the S. cerevisiae genome. The polylinker sequence com- 
prises restriction sites for the restriction enzymes Spel, Smal and 
Pad. 

The present invention relates also to the S. cerevisiae strain MXY47, 
characterized in that diploid cells thereof are homozygous for the 
10 alleles ura3-1, trp1-1, cah1-100 and cyh2R and heterozygous for the 
msh2::KanMX mutation. 

The present invention also relates to the E. coli strain JM101, con- 
taining plasmid pMXY9, and to E. coli strainDHSa, containing plas- 
mid pMXY12. 

15 Plasmids pMXY9 and pMXY12 and the Saccharomyces cerevisiae 
strain MXY47 were deposited on the 3'^ of January 2005 at the 
DSMZ (Deutsche Sammlung fur Mikroorganismen und Zellkulturen 
GmbH, Mascherodenweg lb, 38124 Braunschweig, Germany) under 
accession numbers DSM 1701 0, DSM 17011, and DSM 17026, re- 

20 spectlvely. 

Another aspect of the present invention relates to a kit which can be 
used for conducting the inventive process for generating and detect- 
ing recombined DNA sequences in Saccharomyces cerevisiae. In a 
first embodiment the kit comprises at least a first container which 
25 contains cells of S. cerevisiae strain MXY47, a second container 
which contains cells of E. coli strain JM101 bearing plasmid pMXY9 
and a third container containing cells of E. coli strain DH5a bearing 
plasmid pMXY1 2. 

In a second embodiment the kit comprises at least a first container 
30 containing cells of S. cerevisiae strain l\/IXY47, a second container 
containing DNA of plasmid pMXY9 and a third container containing 
DNA of plasmid pMXY12. 
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The present invention is illustrated by the following sequence listing, 
figures and example. 

Figure 1 shows a schematic of the selection system for the selection 
of recombinants on defined media. Diploid parental celts heterozy- 
5 gous for recombination cassettes - here, recombination substrate A, 
flanked by the URA3 and CAN1 genes, and recombination substrate 
B, flanked by the TBP1 and CYH2 genes - are induced to undergo 
meiosis. Spores are plated on medium lacking uracil and containing 
canavanine (-Ura+Can) and on medium lacking tryptophan and con- 

10 taining cycloheximide (-Trp+Cyh) to select for recombinant cells 3 
and 4, in which a crossover involving the recombination substrates A 
and B has taken place, as indicated by (+). Parental diploids and 
non-recombinant haploids 1 and 2 cannot grow on either of these 
media, as indicated by (-). A subsequent round of meiosis may use 

16 recombinants 3 and 4 to construct a new diploid, which when 
sporulated yields new recombinant cells bearing the same flanking 
marker configurations as those shown in cells 1 and 2. Recombinant 
spore colonies with these configurations can be selected on medium 
lacking uracil and containing cycloheximide (-Ura+Cyh), and on me- 

20 dium lacking tryptophan and containing canavanine(-Trp+Can), re- 
spectively. 

Figure 2 shows the plasmids pMXY9 and pMXY12 (above), which 
are vectors used for the targeting of recombination cassettes to the 
BUD31-HCM1 locus on chromosome III of the yeast genome. Both 
26 plasmids bear sequences homologous to this locus (indicated as 6* 
and 3*), which flank the URA3 and CAN1 markers (pMXY9) or TRP1 
and CYH2 (pl\/IXY12) markers. A short sequence bearing restriction 
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Sites that allow for cloning recombination substrates is located be- 
tween each pair of marker sequences. Below, integration of recom- 
bination cassettes into the BUD31-HCM1 locus. A pMXY9 derivative 
bearing recombination substrate A is digested with Nott to liberate 
5 the recombination cassette flanked by 5' and 3' targeting sequences 
and digestion products are transformed Into MXY47 cells. Ura+ de- 
rivatives that contain a correctly targeted insert are identified for 
subsequent use in constructing strains heterozygous for recombina- 
tion cassettes. Recombination cassettes bearing the TRP1 and 
10 CYH2 markers are similarly constructed in pMXY12 and trans- 
formed into MXY47, followed by selection for tryptophan prototrophy. 

Figure 3 shows the frequency of recombination between Oxa genes 
as a function of sequence identity in wild type and ms/72 strains. 
15 Above^ the mean ± standard deviation of (n) Independent experi- 
ments is provided. Below, graphical representation of these data. 
The following strains were used: MXY60, MXY62, MXY64, MXY66, 
MXY99, and MXY102. 

Figure 4 shows the msh2 hyper-recombination effect. A msh2hN[ ra- 
20 tio was calculated for each independent experiment (total number = 
n) for pairs of strains with the given percent of shared Oxa homology 
and for each selection condition, and the the mean ± standard devia- 
tion of these summed values are shown. The data are represented 
graphically below. The following pairs of strains were used: MXY60 
25 and MXY62, MXY64 and MXY66, MXY99 and MXY1 02. 

Figure 5 shows a PGR analysis of recombination between Oxa se- 
quences sharing 78% homology. Spore colonies were derived from 
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wild type (MXY99) and msh2 (MXY102) diploids by selection on me- 
dium lacking uracil and containing canavanine, or on medium lacking 
tryptophan and containing cycloheximide. Colony PGR was per- 
formed on selected spore colonies that exhibited phenotypes consis- 
5 tent with those expected for crossover recombinants. Above, two 
reactions were carried out for each wild type and msh2 Ura+CanR 
candidate, one with a parental-specific primer pair (KNS16 + KNS28, 
products shown in the first of each pair of lanes for each candidate), 
and the other with a recombinant-specific primer pair (KNS16 + 

10 KNS29, second lane). Below, similar reactions were carried out for 
each wild type and msh2 Trp+CyhR candidate, one with a parental- 
specific primer pair (KNS1 1 + KNS29, first lane), and the other v\^th a 
recombinant-specific primer pair (KNS11 KNS28, second lane). 
Control reactions were carried out on appropriate genomic DNA 

15 templates containing known configurations of flanking marker se- 
quences, either parental (P) or recombinant (R). (-) no DMA control. 

Figure 6 shows the frequencies of recombination for second^ound 
recombination. Wild type and msh2 haploidls obtained after a first 
round of recombination with MXY64 and MXY66 were mated to pro- 

20 duce wild type (MXY81, MXY82 and MXY83) and msh2 (MXY86, 
MXY87, and MXY88) diploids with mosaic OxaZ-Oxal 1 recombina- 
tion cassettes. Wild type (MXY90) and msb2 (MXY92) diploids ho- 
mozygous for the Oxa11 recombination substrate were also con- 
stmcted from recombinant progeny of MXYSO and MXY62. All dip- 

25 loids were sporulated and spores were plated on media to select for 
Ura+CanR and Trp+CyhR recombinants. 
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The sequence listing comprises the following sequences: 

SEQ ID No. 1 and 2 show the sequences of the primers MSH2UP 
and MSH2DN, respectively, for the amplification of MSH2. 

SEQ ID No. 3 to SEQ ID No. 6 show the sequences of the primers 
5 MSH2A1, MSH2A2, MSH2A3 and MSH2A4, respectively, which are 
MSH2-specific analytical primers. 

SEQ ID No. 7 and SEQ ID No. 8 show the sequences of the primers 
kZKANMX and K3KANMX, respectively, which are KanMX-specmc 
analytical primers. 

10 SEQ ID No. 9 and SEQ ID No. 10 show the sequences of the prim- 
ers LEU2UP and LEU2DN, respectively, which are used for the am- 
plification of LEl/2. 

SEQ ID No. 1 1 and SEQ ID No. 12 show the sequences of the prim- 
ers HiSSUP and HIS3DN, respectively, which are used for the ampii- 
15 fication of H/S3. 

SEQ ID No. 13 and SEQ ID No, 14 show the sequences of the prim- 
ers KNS1 and KNS2, respectively, which are used for the amplifica- 
tion of the 3' targeting sequence. 

SEQ ID No. 15 to SEQ ID No. 17 show the sequences of the primers 
20 KNS3, KNS4 and KNS6, respectively, which are used for the amplifi- 
cation of a 5' targeting sequence. 
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SEQ ID No. 18 and SEQ ID No. 19 show the sequences of the prim-' 
ers KNS7 and KNS8, respectively, which are used for the amplifica- 
tion of Oxa7. 

SEQ ID No. 20 and SEQ ID No. 21 show the sequences of the prim- 
5 ers KNS9 and KNS10, respectively, which are used for the amplifica- 
tion of Oxall. 

SEQ ID No. 22 shows the sequence of the primer KNS12, which is a 
BUD31 downstream analytical primer. 

SEQ ID No. 23 shows the sequence of the primer KNS13, which is a 
10 BUD31 upstream analytical primer. 

SEQ ID No. 24 shows the sequence of the primer KNS14, which is a 
TRPf-specific analytical primer. 

SEQ ID No. 25 shows the sequence of the primer KNS15, which is a 
L//?/l3-specific analytical primer. 

15 SEQ ID No. 26 and SEQ ID No. 27 show the sequences of the prim- 
ers KNS17 and KNS18, respectively, which are used for the amplifi- 
cation of CYH2. 

SEQ ID No. 28 shows the sequence of the primer KNS30, which is a 
TRPf-specific forward primer used as sequencing prinner. 

20 SEQ ID No. 29 shows the sequence of the primer KNS31, which is a 
C/lA/f-specific reverse primer used as sequencing primer. 
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SEQ ID No. 30 shows the sequence of the primer KNS33, which is a 
CyH2-specific reverse primer used as sequencing primer. 

SEQ ID No. 31 and SEQ ID No. 32 show the sequences of the prim- 
ers KNS36 and KNS37, respectively, which are used for the amplifi- 
5 cation of Oxa5. 

SEQ ID No. 33 shows the sequence of the primer KNS38, which is a 
l/RA3-speciflc forward primer used as sequencing primer. 

Example ■ Generation of mosaic genes in Saccharomvces cere- 
visiae mismatch repair mutants 

10 1 . IMaterials and methods 

1.1 Media 

Standard rich medium YPD (BiolOI) was used for routine growth, 
and synthetic dropout media (Bio101) were used to monitor genetic 
markers and for selection of recombinants. For sporulation, cells we- 

15 re precultured overnight in SPS (50 mM potassium phthalate, pH 
5.0, 0.5% yeast extract (Dlfco), 1% Bactopeptone (Difco), 0.17% 
yeast nitrogen base, 1% potassium acetate, 0.5% ammonium sui^- 
te) plus required nutritional supplements, washed, resuepended in 
1% potassium acetate plus supplements and incubated with shaking 

20 for two days. All manipulations were carried out at 30^C. For tetrad 
analysis, asci were digested with Helix pomatia B-glucuronidase 
(Sigma) and dissected using a Nikon Eclipse E400 microscope fitted 
with a TDM400 micromanipulator (Micro Video Instruments, Inc.). 
Other genetic methods were conducted as described by Ausubel et 
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al. Current Protocols in Molecular Biology (1998), John Wiley and 
Sons, Inc., New York. All yeast transformations were performed u- 
sing the LiAc method according to Agatep et al.. Technical Tips On- 
line (http://tto.trends.com). 

5 1.2. Yeast strains 

All yeast strains used or created in this study are listed in Table 1 
and Table 2. All yeast strains are isogenic derivatives of the readily 
spooilating W303 background. The diploid MXY47, which serves as 
a host for transformation with recombination cassettes, was con- 

10 structed by transformation and genetic crosses as follows. The hap- 
loid D184-1B (a gift of S.-Gangloff, CEA, France) was transformed 
with a LEU2 fragment (obtained by preparatory PGR of the V\/303 
strain U474 with the primer pair LEU2UP/LEU2DN which are listed in 
the sequence listing to yield the Leu+ haploid MXY13; The haploid 

15 D184-1C (a gift of S. Gangloff) was transfomied with a HIS3 frag- 
ment (obtained by preparatory PCR of ORD4369-25D with the 
primer pair HIS3UP/HIS3DN) to yield the His+ haploid MXY25. The 
haploids MXY18 and MXY22 are recessive cycloheximide-resistant 
{cyh2R) derivatives of D184-1B and D184-1C, respectively, selected 

20 on 10 )xg/ml cycioheximide; the presence of mutations mapping to 
the CYH2 locus that confer cycioheximide resistance was confimrted 
by sequencing (two different nucleotide alterations resulting in a 
change of glutamine 38 to lysine) and segregation analysis. MXY18 
and MXY25 were crossed to obtain the diploid MXY29; MXY13 and 

25 MXY22 were crossed to obtain the diploid MXYSS. The haploid seg- 
regants MXY29-6D and MXY33-8C were crossed to obtain MXY38, 
which is heterozygous for the Ieu2-3,112 and his3-11,15 markers 
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and homozygous for the cyh2R mutation. MXY38 was transformed 
with the msh2::KanMX cassette amplified by PGR from RBT348 (a 
gift of R. Borts, University of Leicester) with the primers MSH2UP 
and MSH2DN to yield MXY47. Transformants were selected on 200 
5 |ig/ml G418 (Invitrogen) and confirmed by colony PGR (see below) 
with the primers l\/ISH2A1, MSH2A2, MSH2A3 and IVISH2A4 and by 
tetrad analysis, i.e. analysis of the four spores, to confirm mariner 
segregation. 

10 Table 1. Haploid yeast strains 



Name 


Genotype 


Source or derivation 


D184-1B 


a uraS-l trp1-1 can1-100N^1T,15leu2' 
3,112 adeZ'l 


S. Gangloff 


D184-1C 


alpha ura3-1 trpl-l canl-100 hrs3'11,15 
Ieu2'3,112ade2'1 


S. Gangloff 


U474 


Gipha um3-1 trpl-l canl-^lOO hFs3-11,1S 
ad02'1 


S. Gangloff 


ORD4369- 
25D 






MXYIS 


a uraS-l trpl-l canl-100 1^^3-11,15 
ade2'1 


D184-1B transformed 
with LEU2 PGR prod- 
uct 


MXVIS 


a ura3 -1 trpl-l canl-lOO cyhSR his3- 
11,15 Ieu2-^,112ade2'1 


D184t1B cyhR deriva- 
tive 


MXY22 


alpha uraS-l trpl-l canl-100 cytw2R hisZ- 
11,15 Ieu2'3,112ade2'1 


D1 84^10 cyhR derfva- 
Uve 


WIXY25 


alpha ura3-1 trpl-l canl-lOO Ieu2'3,112 
ade2-1 


D184-1C transfomned 
with HIS3 PGR prod- 
uct 


MXY29- 
60 


alpha ura3'1 trpl-1 canl-100 cyh^R Ieu2- 
3,112 ade2-1 


MXY29 segregant 
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MXy33- 
8C 


a ura3-1 trpl-l can1-1(X) cyh2R hts3- 
11,15 atto2-1 


MXY33 segregant 


MXY50- 
3D 


alpha m^2::KanMX uraZ-1 trp1-1 cyh2R 
canl-100 leu2-3,112 wto2-1 
BUD31::URA3-CAN1 


MXY50 segregant 


MXY50- 
7D 


alpha ura3-1 trp1-1 cyh2R can1-100 /ife^- 
11,15 ade2-1 BUD31::URA3-CAN1 


MXY50 segregant 


WDCVSI- 
2B 


alpha msh2::KanMX ura3-1 trp1-1 cyh2R 
canl-100 Ns3-11,15 ada2-1 
BUD31::URM-Oxa7'CAN1 


MXY51 segregant 


Mxysi- 

10C 


alpha ufaa-l trpl-l cyh2R csnl-lOO Ieu2' 
3,112 ade2'1 BUD31::URA3-Oxa7-CAN1 


MXY51 segregant 


2A 


a/pfta mshZ-KanMK ura3-1 trp1-1 cyh^ 
canl-100 Ns3-11,15 afle2-1 
BUD31::URA3-0xa11-CAN1 


MXY52 segregant 


MXY52- 
7D 


alpha ura3-1 trp1-1 cyh2R canl-100 Ieu2- 
3,112 adeZ-l BUD31::URA3-Oxa11- 
CAN1 


MXY52 segregant 


MXYSa- 
11C 


a ura3-1 trp1-1 cyhZR canl-100 Ieu2- 
3,112 ad92-1 BUD31::TRP1-CYH2 


MXY53 segregant 


MXY53- 
11D 


a mshZ"KanMX ura3-1 trp1-1 canl-100 
cyh2R h^ll.iS ade2-1 BUD31::TRP1- 
CYH2 


MXY53 segregant 


WDCY55- 
1C 


a mshZ-:KanMX urai3-1 trpl-l canl-100 
cyh2R his3-11,15 ade2-1 BUD31::TRP1- 
OxaYf-CYH2 


MXY5S segregant 


MXY55- 
2B 


a ura^i trpl-l canl-100 cyhZR his3- 
11,15 ade2-1 BUDai-TRPI-Oxall- 
CYH2 


MXY55 segregant 


MXY55- 
13D 


a msh2::KanMX uraS-l trpl-l canl-100 
cyfiZR Ieu2'<3,112 ade2-1 BUD31::TRP1' 
Oxa11'CYH2 


MXY55 segregant 
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MXY79- 
3B 


aflsha ura3-1 trp1-1 ceai1-1(Kio^2R his3- 
11,15 leU2'3.112 adeZ-l BUD31::L/RA3- 
OxaS^ANI 


MXY79 segregant 


MXY79- 
9A 


alpha msh2::Kan MX ura3-1 trp1-1 canl- 
100 cyh2R leu 2-3,112 acie2-1 
BUD31::URA3-Oxa5-CAN1 


MXY79 segregant 


RBT348 


alpha msh2::KanMX ura3 cyhR m&t13-4 
Iys2-d 


R. Bolts 



Table 2. Diploid yeast strains. 



Name 


Genotype 


Source or derivation 


MXY29 


a/a!pha uraS-iy" trpl-l^ cyh2R/CYH2 
can1'100r ^53-11,15^133 Ieu2'^,-112r 
ade2'1/" 


MXY18XMXY25 


MXY33 


a/alpha uraS-l/" trp1-1/" cyh2R/aYH2 
canl'lOO/' his3-11,isr' Ieu2'3,112/LMU2 
a(lG2-ir 


MXY22XMXY13 


MXY38 


a/alpha ura3'1/' trpl-1/' cyh2R/" <^an1- 
100/" his3-11,15/HIS3 leu2'3,112/L^U2 
ade2'1f 


MXY33-8CXMXY29-6C 


MXY47 


a/alpha mshZ':KanMX/MSH2 ura^ir 
trpl-ir cyh2Rr canl-IOOT /ife3- 
11, 15/HfS3 Ieu2'3, 112AMU2 ade2'ir 


MXras transfomned 
wrth msh2::KanMX 
PGR product 


MXY50 


a/alpha msh2::KanMX/MSH2 ura:3-1/" 
trpl'ir cyh2Rr canl-IOOr tils3- 
1l15/HiS3 leu2'3,112d£U2 ade^-lf 
BUD31::URA3-CAN1/BUD31 


MXY47 transformed 
with iVoM-digested 
pMXY9 


MXY51 


a/aipha msh2::KanMX/MSH2 ura:3-1/" 
trpl'ir cyh2Rr canl-IOOf fiisS- 
1115/HIS3 leu2'3,112AEU2 adeZ-ir 
BUD31::URA30xa7'CANimUD31 


MXY47 transfomied 
with A/oM -digested 
pltfXY13 


MXY52 


a/alpha msh2::KanND(/MSH2 uraS/" trpl-^ 
ir cyh2Rf' canl-IOOT' hls3-11,1S/HIS3 
Ieu2-3,112AEU2 ade2-ir BUD31::URA3- 
Oxall'CANI/BUDSI 


i\/IXY47 transfomied 
with A/ofl-digested 
plWXYU 


MXYSa 


a/alpha mshZ':KanMX/MSH2 ura3-ir 
trpl'ir cyh2Rr canl'IOOf tJisS- 
11,15/H!S3 Ieu2''3,112/LEU2 adeS-ir 
BUD31:: TRP1-CYH2/BUD31 


i\/IXY47 transfomied 
with AtoM -digested 
PIWXY12 


MXYSS 


a/alpha msh2::KanMX/" MSH2 ura3'1/" 
trpl'ir cyh2R/'canU100/" /^te3- 
11,15/HIS3 leu2-3,112/L£U2 adeZ-ir 
BUD31::TRP1''Oxa11'CYH2/BUD31 


IWXY47 transfonmed 
with Afofl -digested 
plWXY22 
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MXY57 


a/alpha msh2::KanMX/" ura3-ir trp1-ir 
cyh2Rr can1-10Qr his3-11,1S/HIS3 Ieu2- 
3,112/LEU2 adeZ-ir BUD31::TRP1- 


MXY50-3D X MXY53- 
11D 


MXY59 


a/a!pha ura3-1/" trpl-l/" cyhZR/" canl- 
100/' his3-11,15/HIS3 Ieu2-3,112/LEU2 
ade2'ir BUD31::TRP1-CYH2/ 


MXY50-7D X MXY53- 
11C 


MXY60 


a/alpha ura3-1/" trp1-1/" cyh2R/" canl- 
lOOr his3-11.15/HIS3 Ieu2-3,112A£U2 
ade2-1/" BUD31::TRP1-Oxa11- 


MXY52-7D X MDCYSS- 
2B 


mATOZ 


cyh2Rr canl-IOOrNsa-mS/HISa leu2' 
3,112AJEU2 ade2'1/" BUD31::TRP1' 
Oxa11'CYH2mUD31::URA3'Oxa11' 


1C 


IVIATD4 


et/alnhsi ttna^^iJ" fff%^—iA* f^A%OS9J^ f*s»n^— 
a/aipna Ura^lf UfJl~l/ Gyn£f^ Cani~ 

ade2-1/" BUD31::URA3-Oxa7- 
CANimUD31::TRP1-Oxa11-CYH2 


IVIAT91~1Uw X IVIATOO- 


IMXY66 


a/alpha mshZ:KanMX/" ura3-1/" trp1-ir 

3,112/LEU2 ade2-ir BUD31::JRP1- 
Oxa 1 1-CYH2/BUD31::URA3-Oxa7-CAN1 


MXY51-2B X MXY55- 








MXY79 


a/alpha m^2::KanND</MSH2 ura3-1/" 
bp1-1/" c^2Rr canl-IOOr Ns3- 
11,15miS3 Ieu2-3,112/LEU2 ade2-1/" 
BUD31::URA3-Oxa5-CANimUD31 


MXY47 transformed 
with Afttfl-digested 
pMXr24 














MXY99 


a/alpha ura3-1J" trp1-1f cyh2R/" canl- 
lOOr his3-11. ISr teu2-3,112AJEU2 ade2- 
ir BUD31::URA3-OxaS- 
CAN1/BUD31::TRP1-Oxa11-CAN1 


MXY79-3B X MXY55- 
2B 


MXY102 


a/aipha nvsh2::KanMXr ura3-1/" bp1-1/" 
cyh2Rr canl-ioa/banl M$S-11,1S/HIS3 
Ieu2-3,112/LEU2 a(le2-ir BUD31::URA3- 
OxaS-CANimUD31::TRP1-Oxa11-CAN1 


MXY79^9A X MXY55- 
1C 



1.3 Plasmid construction 

The bacterial strains XL1-Blue MRF{AmcrA)183(mcrCB-hsdSMR- 
mrr)173 endA1 supE44 thi-1 recA1 gyrA96 relA1 lac [F' proAB 
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lacl^ZAM15 TnIO (Tef)]) and JM110 {rpsL [Stf] thr leu thi-1 lacY 
galK galT ara tonA tsx dam dcm supE44 Apac-proAB] [F' traD36 
proAB lacl^ZAM15]) were used as hosts for cloning. Standard meth- 
ods were used for piasmid construction (Ausuisel et a!.). All plasmids 
5 used or created in this study are listed in Table 3. Restriction en- 
zymes, T4 DMA ligase and other enzymes used in cloning were pur- 
chased from New England BioLabs. DNA firagments and plasmids 
were purified using kits supplied by Qiagen and l\Aacherey-Nagel. 

Upstream ("5' target") sequences corresponding to the BUD31 locus 

10 were amplified by preparatory PGR from W303 genomic DNA with 
the primer pair KNS3/KNS4 and cloned as a l^nMXho^ fragment 
into /^/}1^/70l -digested pKSil(+) (Stratagene) to create pMXY1; 
downstream ("3* target") targeting sequences were similarly ampli- 
fied with the primer pair KNS1/KNS2 and cloned as a XX>a1/A/oM 

15 fragment into Xbal/A/oM-digested pKSII(+) (Stratagene) to create 
fMXY2. The TRP1 marker was excised from pJH63 (a gift of R. 
Borts) as a BgnUEcoRl fragment and ligated to 6a/nHI/£coRI- 
digested pMXY1 to create pMXY3, and the URA3 marker was ex- 
cised from XTjol/HmDIII-digested pRED316 (a gift of R. Borts) and 

20 ligated to X/70l/H//)DIII-digested pMXYI to create pMXY4. The CAN1 
marker was isolated from pRED316 as a Sma1 fragment and ligated 
to Hpa1 -digested pMXY2 to create pMXY5. The 5' targeting se- 
quences in pMXY3 and pMXY4 were replaced with sequences re- 
amplified from genomic DNA with the primer pair KNS4/KNS6 and 

25 ligated as KpnVXhol fragments into the respective KpnMPOio^- 
digested plasmids to produce pMXY7 and pMXY6. This step was 
undertaken to conrect the absence from the primer KNS3 of restric- 
tion sites required in later phases of cloning. The Kpn]-Sma\ ft-ag- 
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ment of pMXY6 containing the 5* target and the URA3 marker were 
iigated to KpnI/Smal-digested pMXY5 to produce the URA3'CAN1 
recombination cassette vector pMXY9. The KpnVSpel fragment of 
pMXY7 containing the 5' target and TRP1 marker were Iigated to 
5 Kpnl/Spel-digested pMXY2 to produce pMXY11. Finally, the CYH2 
marker was amplified by preparatory PGR from W303 genomic: DNA 
with the primer pair KNS17/KNS18, digested with SamHI and Pvul 
and iigated to Bg/ll/Pad-digested pMXY1 1 to create the TRP1-CYH2 
recombination cassette vector pMXY12. All piasmid constructs were 
10 introduced into bacterial hosts by electroporation and verified by re- 
stncUon analysis, and pMXY9 and pMXY12 were further verified by 
sequencing of all cloning junctions. 

B-lactamase recombination substrates were amplified by preparatory 
PGR from host plasmids (provided by W. Schoenfeld) using the 

15 primer pairs KNS36/KNS37 for Oxa5 (accession X68272), 
KNS7/KNS8 for Oxa7 (accession X75562), and KNS9/KNS1 0 for 
Oxal 1 (accession Z22590). Oxa7 and Oxal 1 PGR products were 
digested with Pad and Iigated to Smal/Pacl-digested pMXY9 io cre- 
ate pMXY13 and pMXY14, respectively, and the Oxa11 PGR product 

20 was also digested with Spei and Pad and Iigated to SpeMf^acX- 
digested pMXY12 to create pMXY22. The Oxa5 PGR products were 
digested with SamHI and Pad and Iigated to Bg/ll/PacWigested 
pMXY9 to create pMXY24. All constructs were verified by restriction 
analysis. 
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Table 3. Plasmids 



Name 


Description or insert 


Source 


pKSII (+} 


Parental vector 


Stratagene 


PRED316 


URA3 and CAN1 source 


R. Sorts 


pJH53 


77?Pf source 


R. Borts 


pMXY1 


5* target 


This work 




3* target 


This work 


PMXY3 


5'taiget-77?Pt 


This work 


pMXy4 


5'target-tyw\3 


This work 


PWIXY5 


C>q/Vf-3' target 


This work 


pMxye 


5'taiget-(y/?/\3 


This work 


pMXY7 


5' tafget-77?P1 


This work 


pMXY9 


5' taiget-l/y?/l3.C>^/Vf-3' target 


This work 


pMXYII 


5' target-TRPW target 


This work 


PMXY12 


5' taigel-7]RPy.CYH2-3' taigel 


This work 


PMXY13 


5' target- W/»3-Oxa7-C/IW/-3' 
target 


This work 


PMXY14 


5' target-(//M30xa1 1-C/Wf-3' 
target 


This work 


pMXY22 


5' target-r/?Pf-Oxa11-cyH2.3' 
target 


This work 








PMXY24 


5" targel-(/FM30xa5-C>Wf-3' 
target 


Thiswoik 



1.4 Recombinant selection and characterigation 



For first round recombination, plasmids bearing recombination cas- 
settes were digested with A/ofI and total digestion products were 
5 used to transform MXY47. Uracil (for pMXY9 derivatives) or trypto- 
phan (for pMXY12 derivatives) prototrophs were selected, and tar- 
geting of one of the two chromosomal copies of the BUD31-HCM1 
locus by the introduced construct was confirmed by colony PGR us- 
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^ ing the primers KNS12/KNS13/KNS15 for URAS-CANI derivatives 
and the primers KNS12/KNS13/KNS14 for TRP1-CYH2 derivatives, 
which allow fragments from the Intact and from the disrupted BUD31- 
HCM1 loci to be amplified. Transformed heterozygotes were 
5 sporulated and tetrad analysis was earned out to identify wild type or 
msh2 haploids bearing recombination cassettes. Appropriate hap- 
loids of opposite mating type were patched on YPD plates, allowed 
to grow overnight, mixed together on the same YPD plate and al- 
lowed to mate overnight The mating plate was replica plated to - 
10 Ura-Trp medium to select for diploids, which were inoculated the fol- 
lowing day in bulk into SPS plus supplements and cultured over- 
night. The preculture was spun down and washed, and the cells 
were resuspended in 1% K acetate plus supplements and incubated 
for two days. 

15 Sporulated cells were harvested, quantified, and in some cases dis- 
sected to confirm appropriate segregation of all markers. Asci were 
digested with zymolyase-20T (ICN Biomedicals) to liberate spores, 
the spore suspension was sonicated (Branson Model 250 Digital 
Sonifier), and appropriate dilutions were plated on YPD to detemnine 

20 cell viability, on uracil dropout media containing 60 ug/ml ca- 
navanine (Sigma) to select Ura+CanR recombinants, and on trypto- 
phan dropout media containing 3 ug/ml cycloheximide (Sigma) to 
select Trp+CyhR recombinants. Spore colonies arising on each me- 
dium were counted and subjected to phenotypic and molecular tests 

26 to determine whether they represented true recombinants. For 
phenotypic analysis, a representative number of candidate 
recombinants was restreaked to the same medium used for selection 
and then replica plated to -Ura, -Trp, cycloheximide (10 jig/ml), 
canavanine (60 jxg/ml), and mating type tester plates. 
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Spores were also plated on - 
Ura-Trp media to determine the frequency of diploids for each spore 
preparation, which in all cases was lower than 4% of total viable 
cells. For molecular analysis, total genomic DNA was subjected to 
5 analytical PGR (see below) using appropriate primer pairs that spe- 
cifically amplify parental or recombinant fragments. The frequencies 
of recombination for a given selection are expressed as the fre- 
quency of viable cells on a given selection medium, corrected for the 
presence of non-recombinants exhibiting a false positive phenotype. 
10 In most cases, such false positives arose by mutational inactivation 
of the CAN1 or CYH2 marker, as suggested by analytical PGR. 

For second round recombination, appropriate recombinants derived 
from the first round of recombination were mated and Ura+Trp+ dip- 
loids were selected. The same sporulation procedure as for first 

15 round recombination was followed, except that spores were plated 
on YPD, on uracil dropout media containing cyclohexfmide to select 
Ura+GyhR recombinants, and on tryptophan dropout media contain- 
ing canavanlne to select Trp+GanR recombinants. Candidate re- 
combinants were similarly subjected to phenotypic and molecular 

20 analysis. 

1.5 Molecular methods 

Genomic DNA used as a template for preparatory or analytical PGR 
was prepared from overnight YPD cultures by a standard miniprep 
procedure according to Ausubel et al. Preparatory PGR of fragments 
25 used in cloning or for sequencing was performed with Oxa plasmid 
DNA (approximately 50 pg) or yeast genomic DNA (approximately 
0.5 ixg) as a template in 50 \xl reactions containing 2.5 U Herculase 
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polymerase (Stratagene), 1x Herculase reaction buffer, 0.2 mM each 
dNTP and 100 ng each primer. Amplification was carried out as fol- 
lows: 94" C 2 min; 30 cycles of 94*'C 10s, SS^C 30 s, 72?C 30s; 68"C 
10 min. A modified colony PGR procedure was employed to confinn 
5 integration of recombination cassettes at the BUD31 locus 
fhttp://www.fhcrc.ora/labs/hahn/methods/mol bio meth/ocr veast c 
olonv.html) . with the following amplification conditions: SS^C 5 min; 
35 cycles of 95*'C 1 min, 55«»C 1 min, 1 min; 72?C 10 min. Ana- 
lytical PGR to characterize Oxa inserts was carried out in 100 ^1 re- 

10 action volumes containing approximately 0.5 ^g genomic DMA pre- 
pared from candidate recombinants and control strains, 1.5 U Taq 
polymerase (Roche), 1 x reacUon buffer, 0.2 mM each dNTP and 
100*ng each primer, with the same amplification conditions as for 
colony PGR, except that extension was carried out at 68"G for 2 min. 

1 5 Ail amplification reactions were performed with a Masterc^cler gradi- 
ent 5331 (Eppendorf). 

For sequence analysis of recombinant Oxa inserts, preparatory PGR 
was carried out with the primer pairs KNS16/KNS29 (for Ura+GanR 
recombinants) or KNS1iyKNS28 (for Trp+GyhR), followed by purifi- 

20 cation with the Qiaquick PGR kit (Qiagen). PGR products were se- 
quenced by Genome Express (Meylan, FR) with the primers KNS30, 
KNS31, KNS33 or KNS38, as appropriate. Recombinant sequences 
were aligned and analyzed using Glone Manager software (Sci Ed 
Gentral). Oligonucleotides used in PGR and sequencing (Table 3) 

25 were purchased from Proligo France. 

2. Results 
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2.1 Development of a veast meiotic homeoloaous recombination 
system 

A strategy that makes use of the yeast Saccharomyces cerevisiae to 
promote in vivo recombination between diverged DNA sequences 
5 has been developed. Critical features of the strategy include the use 
of meiotic cells, in which high levels of genome-wide recombination 
take place, and inactivation of the mismatch repair (MMR) system, 
which normally restricts recombination between diverged sequences. 
Sequences to be recombined, i.e. the recombination substrates, are 

10 introduced into one of two vectors that also bears flanking marker 
sequences, so as to create recombination cassettes. The recombi- 
nation cassettes are introduced into the yeast genome, at a locus on 
chromosome III (the BUD31-HCM1 interval), which is in a region 
known to be recombinationally active in meiosis. Diploids heterozy- 

15 gous for recombination cassettes are sporulated, and spores are 
plated on media that select for cells with specific configurations of 
flanking markers, thereby allowing for the selection of recombinants 
in which a crossover involving recombination substrates has taken 
place (Figure 1). 

20 Two general recombination cassette vectors were constructed, 
pMXY9 and pMXY12, which contain the URA3 and CAN1, and the 
TRP1 and CYHZ markers, respectively, flanking restriction sites that 
can be used for the introduction of recombination substrates (Figure 
2). The URA3 marker confers uracil prototrophy, and the CAN1 

25 marker confers canavanine sensitivity. In the absence of this marker, 
cells are resistant to the dmg. The TRP1 marker confers tryptophan 
prototrophy and the CYH2 marker confers cycloheximide sensitivity. 
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In the absence of this marker, cells are resistant to the drug. Each of 
the two recombination cassettes Is in tum flanked by sequences that 
allow targeting of the entire Insert to the BUD31-HCM1 locus by 
transformation of competent cells (Figure 2). A strain that serves as 
5 a primary host for transformation, MXY47, was also constructed (T a- 
ble 2). This diploid is heterozygous for the msh2::KanMX mutation, 
and is phenotypically wild type with respect to MMR. It is also homo- 
zygous for the uraS-l, trp1-1, canl-100 and cyh2R markers, which 
allows the presence of recombination cassette markers to be moni- 

10 tored, and heterozygous for the his3-11,15 and leu2'3,112 markers. 
MXY47 is transfomned with fragments bearing recombination cas- 
settes, primary transformants are selected as Ura+ or Trp+ proto- 
trophs (for MXY9 and MXY12 derivatives, respectively), and target- 
ing is confirmed by analytic PGR using primers that recognize se- 

15 quences within and external to the introduced construct. Primary 
transformants are sporulated, and tetrads are dissected and replica 
plated to identify wild type or msh2 segregants that bear the recom- 
bination cassette. Suitable haploids are mated to one another to 
generate MSH2/MSH2 (wild type) and msh2/msh2 diploids het- 

20 erozygous for recombination substrates. In a first round of meiotic 
recombination to generate recombinants, these diploids are 
sporulated, and free spores are plated on media lacking uracil and 
containing canavanine to select for recombinants with the URAS- 
CYH2 configuration of flanking markers (Ura+CanR spore colonies), 

25 or on media lacking tryptophan and containing cycloheximide to se- 
lect for the TRPI-CANI configuration (Trp+CyhR spore colonies). 
Parental diploids and non-recombinant haploid progeny cannot grow 
on these media. The frequency of spore colonies arising on selective 
media is detemiined, candidate recombinant spore colonies are 
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Characterized phenotypically by replica plating on test media and 
molecularly by PGR with appropriate primer pairs, and a sample of 
confimied recombinants is selected for sequencing. 

The strategy is iterative in that cells bearing recombinant inserts can 
5 be identified and subjected to further rounds of meiotic recombina- 
tion to increase diversity. In a second round, Ura+CanR and 
Trp+CyhR haploids are mated, and the spomlation and selection 
process is repeated, except that new recombinants are selected on 
media lacking uracil and containing cycloheximide. to select for re- 

10 combinants with the URA3-CAN1 configuration of flanking markers 
(Ura+CyhR spore colonies), or on media lacking tryptophan and con- 
taining canavanine to select for the TRP1'CYH2 configuration 
(Trp+CanR spore colonies). The strategy detailed here can also be 
modified to include additional markers to increase the stringency of 

15 selection. Furthermore, recombinants can also be directly selected 
by PGR using primers specific to flanking sequences. 

2.2 PhenotvDic selection for recombination between Oxa oene 

pairs of varvina sequence divergence 

Genes belonging to the Oxa superfamily of beta-lactamases were 
20 chosen as substrates to test the feasibility of the system for the se- 
lection of recombinants. Recombination between the following Oxa 
pairs was assessed in the wild type and msh2 backgrounds: Oxal 1- 
Oxa11, which share 100% homology throughout the 800 bp ORF; 
0xa7-0xa11, 95%; 0xa5-0xa11, 78%. Diploids generated by 
25 crosses between appropriate haploids were induced to enter meio- 
sis. Spores were prepared from meiotic cultures, and serial dilutions 
were plated on YPD to determine cell viability and on medium lack- 
ing uracil and containing canavanine (-Ura+Gan) and on medium 
lacking tryptophan and containing cycloheximide (-Trp+Gyh) to se- 
30 lect for recombinants. 
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2.3 Frequencies of recombination between Oxa genes of varying 
seouence homoloov 

The data shown in Figure 3 demonstrate that in the wild type back- 
ground, increased sequence heterology has a strong inhibitory effect 
5 on crossover recombination, and that this effect is relieved but not 
abolished by the msh2 mutation. In general, the msh2 mutation 
causes an increase in the frequency of recombination of about one 
order of magnitude above that observed for wild type strains at the 
two levels of divergence tested. However, inactivation of MSH2 

10 alone does not fully compensate for the inhibition of recombination 
between recombination substrates with higher degrees of heterol- 
ogy. For example, the frequencies of recombination for a msh2 strain 
with Oxa Inserts sharing 78% homology (MXY102) are at least 10- 
fold (Ura+CanR) and 25-fold (Trp+CyhR) below those found for a 

15 wild type strain with Oxa inserts of 100% homology (MXY60), indi- 
cating that factors other than AfSH2-dependent mismatch repair pre- 
vent crossover recombination between more diverged sequences. It 
is noteworthy that the appearance of msh2 recombinants at the 78% 
divergence level, at frequencies of roughly 2 x 10"^, indicates that 

20 recombination may be achieved between even more divergent sub- 
strates. 

2.4 The msh2 hvoerHrecombination effect 

The effect of msh2 on homologous and homeologous recombination 
25 was quantified by first calculating the ratio of msh2 to wild type 
recombinants for a given percent of homology for a given selection 
for each experiment, and then calculating the means and standard 
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deviations of the ensemble of ratios thus determined. The data are 
shown in in Figure 4. The presence of the msh2 mutation increases 
the frequency of homeologous recombination for sequences of 95% 
and 78% homology, and there is a less pronounced but still quantifi- 
5 able enhancement of recombination between 100% identical se- 
quences. Furthermore, the extent of the msh2 enhancement of ho- 
meologous recombination differs for the two selections: for strains 
with homeologous Oxa inserts, inactivatlon of MSH2 increases the 
frequency of Trp+CyhR recombinants to a greater extent than it in- 

10 creases the frequency of Ura+CanR recombinants. In principle, the 
frequencies of both types of recombinants (Ura+CanR and 
Trp+CyhR) should be equivalent, but these numbers indicate that 
there are biases in the system that are provoked or enhanced by the 
msh2 mutation, in conjunction with variations in the extent of se- 

15 quence divergence. Experiments to test the relative influences of 
inserts and flanking marker sequences on the types of recombinants 
obtained indicate that this bias is a property of the flanking markers 
but the influence of the recombination substrates in directing the out- 
comes of meiotic recombination events cannot yet be accounted for 

20 (data not shown). 

2.5 PGR analvsis of selected recombinants 

An example of PGR analysis, as applied to Ura+GanR and 
Trp+GyhR spore colonies derived from wild type and msh2 diploids 
25 containing Oxa genes of 22% divergence (MXY99 and MXY102, 
respectively) is shown In Figure 5. For each strain, ten spore colo- 
nies that exhibited each recombinant phenotype were analyzed. 
Extracts from each colony were used as templates for amplification 
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with primer pairs that specifically amplify parental molecules and 
with primer pairs that specifically amplify recombinant molecules. 
In every case(s), only the predicted recombinant insert was ampli- 
fied, indicating that the selected spore colonies contained se- 
5 quences produced by recombination between the parental Oxa re- 
combination substrates. These results also demonstrate that recom- 
binant molecules can be directly recovered from sporulated cultures, 
even without the imposition of a genetic selection step. Here, the 
primer recognition sites located in the URA3, CAN1, TRP1 and 
1 0 CYH2 genes represent molecular marker sequences that flank each 
recombination substrate. 

2.6 Secuence analvsis of first round meiotic homeolooous recom- 
binants: 5% and 22% divergence 

0xa7-0xa11 meiotic recombinants derived from wild type and msh2 
15 diploids (MXY64 and MXY66, respectively), which contain recombi- 
nation substrates sharing 95% homology, that satisfied phenotypic 
and molecular (PCR) tests were subjected to sequence analysis. 
Recombinant fragments were amplified with primers specific to flank- 
ing markers and sequenced using primers close to the translational 
20 start and stop sites. Overall, 55 recombinant sequences derived from 
haploid progeny of 0xa7-0xa 11 diploids were analyzed: 14 
Ura+CanR and 13 Trp+CyhR recombinants from MXY64, and 14 
Ura+CanR and 14 Trp+CyhR recombinants from MXY66. The se- 
quenced sample size allows several observations to be made. 1 ) For 
25 both wild type and msh2 recombinants, the position at which the 
crossover took place ranged throughout the full coding region, with 
no apparent preference for a specific interval. Crossovers that oc- 
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curred in the 5' region were as likely as those in the 3* region. Also, 
for a given strain, there was no apparent difference in the distribu- 
tions of crossover sites for spore colonies obtained by -Ura+Can or 
by -Trp+Cyh selection. 2) The length of unintenrupted homology in 
5 the crossover interval was also unimportant: crossovers were de- 
tected between two closely spaced polymorphisms (positions 543- 
552 for MXY66 Trp+CyhR #7, #8, and #13, where position 1 repre- 
sents the adenosine residue of the ATG translational start site) as 
well as between the two most widely spaced polymorphisms (posi- 

1 0 tions 1 63-265, eg MXY66 Trp+CyhR #1 5). 3) The recombinant Oxa 
inserts isolated from both wild type and msh2 backgrounds con- 
tained full-length recombinant sequences potentially capable of en- 
coding new, functional Oxa proteins. That is, all crossovers occurred 
in such a manner as to preserve an Intact ORF, without a net inser- 

15 tion or deletion of nucleotides in the crossover interval or in any 
other interval. 4) Although the structures of most recombinant se- 
quences are consistent with a simple crossover between the two 
Oxa sequences in local regions of homology, several recombinants 
isolated in the msh2 background exhibited greater complexity. Se- 

20 quences derived from four recombinants (MXY66 Ura+CanR #16 
and #31, and MXY66 Trp+CyhR #5 and #9) exhibited a higher de- 
gree of mosaicism, as if they were produced by more than one 
crossover event. Indeed, analysis of two of these recombinants was 
complicated because inspection of electropherograms revealed the 

25 presence of two overlapping peaks at multiple sites within the se- 
quenced region, each site conresponding to an Oxa 7-0xa11 poly- 
morphism. This obsen^ation indicates that the population of mole- 
cules that was sequenced was heterogeneous, for which the most 
likely explanation Is the presence of unrepaired or partially repaired 
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heteroduplex DNA present in msh2 recombinant spores. This 
interpretation is consistent witli the l^own increased frequency of 
post-meiotic segregation (PMS) caused by the msh2 mutation. In 
these two cases, MXY66 Ura+CanR #16 and #31, one or more 
6 repaired sites was flanked by stretches of unrepaired heteroduplex, 
consistent with the unmasking of a short patch mismatch repair 
activity in the msh2 background, as suggested by CoTc, G\uck and 
Fabre (EMBO J. 19:3408). Several other cases of PMS unassociated 
with short-patch mismatch repair were also observed for msh2 

10 recombinant sequences, indicating that this alternative mismatch 
repair system may not be highly efficient at correcting mismatches in 
heteroduplex DNA. Judging from sequence eleclropherograms, the 
extent of uncorrected heteroduplex varied, from a short region of 
about 50 nt to a region almost covering the entire ORF. No evidence 

15 for PMS or short-patch mismatch repair was found for wild type 
recombinant sequences. Overall,, these findings suggest that the 
extent of diversity created is greater in msh2 meiosis than in wild 
type meiosis. 

Melotic recombinants were also derived from wild type arid msh2 
diploids that contain recombination substrates sharing 78% homol- 

20 ogy, (MXY99 and MXY102, respectively), in total, 24 recombinant 
sequences derived from recombinant progeny of 0xa5-0xa 11 dip- 
loids were analyzed: five Ura+CanR and three Trp+CyhR recombi- 
nants from MXY99, and nine Ura+CanR and seven Trp+CyhR re- 
combinants from MXY102. Inspection of these sequences suggests 

25 several trends. 1) Recombinant Oxa sequences obtained In both 
wild type and msh2 strains by selection on -Trp+CyhR exhibited 
crossovers at different positions throughout the ORF, with perhaps a 
slight tendency towards the middle 250 bp region (nt 333-nt 573) of 
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overall shared homology. In contrast, recombinants obtained in both 
wild type and msh2 strains by selection on -Ura+CanR exhibited a 
pronounced bias in the positions of crossovers: in 3 of 5 wild type • 
and 8 of 9 msh2 sequences, crossovers occurred within the last 80 
5 nt of the region of shared homology, ie, the last 10% of the ORF. 2) 
The intervals of absolute homology in which crossovers were identi- 
fied ranged from 11 to 20 nt for the -Trp+Cyh selection, indicating a 
preference for these relatively larger regions of sequence identity. In 
contrast, crossover intervals were shorter for the -Ura+Can selec- 

10 tlon, ranging from 3 to 17 nt (13/14 of these Involved intervals 13 nt 
and shorter). 3) As for recombination involving sequences sharing 
95% homology, the new sequences obtained also consisted of intact 
ORFs and potentially encode novel Oxa proteins. 4) No cases of 
PMS, as judged by inspection of eledropherograms, were found for 

15 wild type recombinants, but very short patches of unrepaired hetero- 
duplex were found for a few m$h2 recombinants, including 3 of the 7 
Trp+CyhR recombinants. These regions included at most 67 nt, 
shorter than some of the tracts observed for 0xa7-0xa1 1 recombi- 
nants. In sum, these observations indicate that recombinant se- 

20 quences can be selected from input recombination substrates vary- 
ing by at least 22%, and that these sequences encode novel pro- 
teins. 

2.7 Sequence analvsis of 0xa7-0xa11 second-round recombi- 
nants 

25 The ability of the yeast system to increase sequence diversity in an 
iterative manner was tested by . constructing diploids from Oxa7- 
Oxa1 1 recombinant hapioids generated in a first round of meiosis 
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and subjecting these new diploids to a second round of meiosis. 
Among tlie sequenced Ura+CanR and Trp+CyhR progeny of MXY64 
and MXY66, pairs of appropriate recombinants with crossovers in 
the same interval were selected to construct new diploids in which 
5 the overall level of sequence homology was again 95%. Three wild 
type (MXY81, MXY82 and MXY83) and three msh2 (MXY86, MXY87 
and MXY88) diploids were created. Control wild type and m$h2 dip- 
loids containing only Oxa1 1 sequence inserts were also constructed 
from appropriate recombinant progeny of MXY60 and MXY62, yield- 

10 ing MXY90 and MXY92. These diploids were sporulated and spores 
were plated on medium lacking uracil and containing c^cloheximide 
(-Ura+Cyh) and on medium lacking tryptophan and containing ca- 
navanine (-Trp+Can) to select for secondnround recombinants. As 
shown in Figure 6, the frequencies of -Ura+CyhR and Trp+CanR 

15 spore colonies observed for all of these strains is consistent with the 
anti-recombination effect of the MSH2 gene. Both types of colonies 
were found among progeny of the wild type homozygote MXY90 at 
frequencies above 1 0"^, whereas these frequencies were decreased 
5- to 10-fold among progeny of wild type diploids with Oxa insert 

20 heterology (MXY81. MXY82 and MXY83). Inactivation of the MSH2 
gene in diploids with diverged Oxa inserts (MXY 86, MXY 87, and 
MXY 88) led to a 2 to 6-fold increase in the frequency of Ura+CyhR 
and Trp+CanR spore colonies, similar to the levels seen for a msh2 
diploid bearing identical Oxa inserts (MXY92). Although the media 

25 used differ from those used for selection of first-round recombinants, 
the frequencies at which wild type and msh2 second-round recombi- 
nants were selected are comparable to those for first-round recom- 
binants. 
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Ura+CyhR and Trp+CanR spore colonies In both the wild type 
(MXY81 and MXY83) and msh2 (MXY86) backgrounds were se- 
lected for sequencing. In all, 14 wild type and 7 msh2 Oxa inserts 
were sequenced. In most cases, a crossover occurred in a novel in- 
5 terval during second round recombination, again without apparent 
bias with respect to position or interval size: crossovers involving 
different intervals were found throughout the Oxa ORF and they oc- 
curred in intervals as large as 101 nt and as small as 5 nt In one 
case (a MXY83 Trp+CyhR haploid), a second round crossover oc- 

10 cunred in the first round crossover Interval, thereby restoring a full 
Oxal 1 sequence. Recombinants recovered from msh2 diploids were 
more diverse than those recovered from wild type strains. For the 
msh2 diploid MXY86 several spore colonies exhibiting extensive 
PMS and sequence mosaicism were observed, consistent with the 

16 fonmation of long tracts of heteroduplex in the recombinatlonal inter- 
mediate. Furthenmore, some mismatches were repaired in the hete- 
roduplex tract, again consistent with a short-patch mismatch repair 
activity. In sum, second-round recombination in the msh2 back- 
ground is as efficient as first-round recombination, both qualitatively, 

20 with respect to generating sequence diversity (eg., crossover interval 
distribution and incidence of PMS), and quantitatively, with respect 
to increasing the overall frequency of homeologous (5% divergence) 
recombination. 



